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Sir : ' 

I, J. Hans van de Sande, Ph.D., a citizen of Canada, hereby declare and 

state : 

1. The curriculum vitae attached as Exhibit A accurately reflects my 
professional credentials. As noted in Exhibit A, I am presently Vice-Dean of 
the Faculty of Medicine as well as a Professor in the Department of 
Biochemistry & Molecular Biology (formerly known as the Department of Medical 
Biochemistry) at the University of Calgary in Calgary, Alberta, Canada. 

2. I am a paid consultant of Ingeneus Corp. through my association 
with Genetic Diagnostics, Inc., a licensee of technology owned by Ingeneus 
Corp. I expect to be compensated for my time expended preparing this 
document . 

3. Prior to executing this Declaration, I reviewed the 
above -identified application, the June 5, 2002 Final Rejection, the January 
18, 2002 DECLARATION UNDER 37 C.F.R. § 1.132 of Jasmine I. Daksis, and the 
January 21, 2002 DECLARATION UNDER 37 C.F.R. § 1.132 of Richard A. Collins. 

4. The purpose of this Declaration is to address the assertions in 
the Final Rejection that the application: (a) does not enable one skilled in 
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the art to make and/or use the invention; and (b) the claimed multiplex 
structure lacks patentable utility. 

Enablement 

5. Counsel for Ingeneus Corp. has advised me that an invention is 
patentably enabled if one of ordinary skill in the art could make or use the 
invention from the disclosures in the patent application coupled with 
information known in the art without undue experimentation. While I am not 
an expert in patent law, my experience and educational background, 
particularly as a professor of molecular biology and biochemistry, enable me 
to render an informed opinion as to the facts underlying the determination of 
enablement, including the level of ordinary skill in the art, information 
known in the art at the time of the invention, and what constitutes undue 
experimentation to one of ordinary skill in the art. 

6. Composition claim 1 specifies a catalytic hybridization 
composition comprising a probe, an enzyme, a target and a hybridization 
medium, wherein at least one of the probe and the target is double -stranded 
and is bonded to the other of the probe or the target solely through 
Watson-Crick base triplets. Method claim 24 specifies a catalytic 
hybridization method for assaying binding, comprising the formation of 
probe-target multiplexes like those in claim 1. The meaning of Watson-Crick 
base triplets in the context of the invention is provided in the application 
at page 12, lines 17-22: 

In certain triplex and quadruplex embodiments, each nucleobase 
binds to no more than two other nucleobases . Thus, in addition 
to the traditional Watson-Crick base pairs, such embodiments 
include the following Watson-Crick base triplets: A-T-A, T-A-T, 
U-A-T, T-A-U, A-U-A, U-A-U, G-C-G and/or C-G-C (including C+-G-C, 
and/or any other ionized species of base) . 
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in view of the following passage of the application at page 11, lines 17-24: 

As used herein, the term "Watson-Crick bonding" is intended to 
define specific association between opposing pairs of nucleic 
acid (and/or nucleic acid analogue) strands via matched, opposing 
bases. While the formation of a Watson-Crick quadruplex may 
sometimes be referred to as a hybridization event herein, that is 
merely for convenience and is not intended to limit the scope of 
the invention with respect to how the formation of a Watson-Crick 
quadruplex can be best characterized. 

One of ordinary skill in the art would have understood from the foregoing 

that the probe-target multiplexes of claims 1 and 24 are triplexes of three 

nucleobase-containing strands and quadruplexes of four nucleobase-containing 

strands, wherein Adenines align with Thymines (or Uracils) and Cytosines 

align with Guanines. 

7. Applicants have shown through binding studies specific 
association of non-denatured dsDNA targets with non-denatured dsDNA probes to 
form Watson-Crick multiplexes. The concentration of KCl used in, e.g., 
Example 1 of the parent application (09/664,827) was sufficiently high that 
it was highly unlikely that strand displacement occurred or that the dsDNA 
probes or dsDNA targets were denatured in any way. It is known that high 
concentrations of salt (e.g., 100 mM NaCl) inhibit strand invasion by virtue 
of the salt increasing the stability of dsDNA. See, e.g., Tomac et al., 
"Ionic Effects on the Stability and Conformation of Peptide Nucleic Acid 
Complexes," J. Am. Chem. Soc . 118, 5544-5552 (1996) (previously made of 
record in the application on January 18, 2002) . Accordingly, one of ordinary 
skill in the art would have expected the 100 mM KCl concentration of 
Example 1 to prevent strand displacement and denaturation . 

8. In light of Applicants' evidence that two strands on opposing 
non-denatured duplexes specifically interact together A:T(U) and C:G, one of 
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ordinary skill in the art would have found it reasonable to infer that 
adjacent bases in the remaining two strands of the duplexes would be brought 
into close enough proximity by the initial pairing of opposing strands to 
specifically interact together A:T(U) and C:G. This inference would not have 
been considered by an ordinarily skilled artisan to be unreasonable, 
particularly in view of the teachings of, eig., Zhang et al . , "Dimeric DNA 
Quadruplex Containing Major Groove Aligned A©T©A«T and G©C®G©C Tetrads 
Stabilized by Inter-subunit Watson-Crick A®T and G®C Pairs," 312 J. Mol . 
Biol. 1073-88 (Oct. 5, 2001), attached as Exhibit B. 

9, Zhang et al . disproves the theory in the Final Rejection at 
page 6, last sentence, that "Watson-crick hydrogen bonding surfaces are 
inaccessible for any other strands [i.e., strands other than the two 
hybridized strands of conventional duplexes] since two strands are already 
interacting with each other at the center of the double helix." Zhang et 
al., shows through NMR studies the formation of A-T-A-T tetrads similar to 
previously discovered G-C-G-C tetrads. Zhang et al . at pages 1073-74 states: 

[E] f forts have been made to identify and characterize G»C«G«C 
tetrads, where a pair of Watson-Crick G«C pairs can potentially 
align either through their major groove or their minor groove 
edges. . . . recent studies have demonstrated that G»C*G*C 
tetrads aligned through their major groove edges can switch 
between two distinct alignment geometries [shown in Figure 1(a) 
and 1(b)]. . . . The major groove-aligned G*C*G*C tetrad has now 
been observed in a range of DNA quadruplexes and appears to be a 
robust tetrad motif adopted by a wide range of DNA sequences. 

Figure 1 of Zhang et al . shows how major groove -aligned G»C«G»C and A«T«A®T 

tetrads in their direct alignment geometry have each G hydrogen bonded to 

each C, and each A hydrogen bonded to each T. Thus, contrary to the Final 

Rejection, Zhang et al . and the art cited therein shows that quadruplex 

G-C-G-C and A-T-A-T binding reasonably credible. 
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10. Thus, one of ordinary skill in the art (who has a high level of 
skill and a high tolerance for complex experimentation) would have been able 
to make and use with no more than routine experimentation compositions 
comprising Watson-Crick multiplexes for specific and useful purposes such as 
the catalytic hybridization assay of claim 24 without ever knowing for 
certain the location, length and/or number of hydrogen bonds between adjacent 
bases in the multiplex. 

Utility 

11. As shown by Zhang et al . , the art recognizes the existence of 
quadruplex G-C-G-C and A-T-A-T binding under certain conditions, contrary to 
the assertion in the Final Rejection. Applicants have shown through binding 
studies specific association of non-denatured dsDNA targets with 
non-denatured dsDNA probes, wherein the targets and probes align Adenine to 
Thymine (or Uracil) and Cytosine to Guanine. Accordingly, one of ordinary 
skill in the art would have found the claimed invention to be reasonably 
credible in view of the original disclosure of the invention and conventional 
wisdom in the art. 

12. Similarly, the binding studies described in the previously filed 
Daksis and Collins Declarations further show the reasonable credibility of 
the triplex embodiments of this invention, as do the inventors' previously 
issued U.S. Patents Nos . 6,420,115, 6,403,313 and 6,265,170, which include 
many examples of Watson-Crick triplex binding, wherein Adenines align with 
Thymines (or Uracils) and Cytosines align with Guanines. 

13. In addition, I have personally observed triplex hybridization 
experiments similar to those described in the working examples of the 
application, wherein single-stranded probes were able to discriminate between 
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perfectly matched duplex targets and mismatched duplex targets under 
non-denaturing conditions. The experiments further convinced me of the high 
degree of recognition between the single -stranded probe and the duplex target 
with complete base pairing between binding partners. 

I hereby declare that all statements made herein of my own knowledge 
are true, and that all statements made on information and belief are believed 
to be true; and further that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine 
and/or imprisonment under Section 1001 of Title 18 of the United States Code, 
and that such willful false statements may jeopardize the validity of the 
application or any patent issuing therefrom. ^ — s7 
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National Research Council of Canada Studentship 
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We report on an NMR study of unlabeled and uniformly 'X^N-labeled 
d(GAGCAGGT) sequence in 1 M NaO solution; conditions under which 
it forms a head-to-head dimeric quadruplex containing sequentially 
stacked C C G C G-G-G-G and A T A T tetrads. We have identified, 
for the first time a slipped A*T*AT tetrad alignment; Involving recog- 
nition of Watson-Crick A T pairs along the major groove edges of oppos- 
ing adenine residues. Strikingly, both Watson-Crick GC and A T 
pairings within the direct GC GC and slipped A T A T tetrads, 
respectively; occur between rather than within hairpin subunits of the 
dimeric d(GAGCAGGT) quadruplex. The hairpin turns in the head-to- 
head dimeric quadruplex involve single adenine residues and adds to 
our knowledge of chain reversal involving edgewise loops in DNA quad- 
ruplexes, Our structural studies, together with those from other labora- 
tories; definitively establish that DNA quadruplex formation is not 
restricted to G„ repeat sequences, with their characteristic stacked uni- 
form G G G-G tetrad architectures, Rather, the quadruplex fold is a 
more versatile and robust architecture, accessible to a range of mixed 
sequences, with the potential to facilitate C C G C and A T-AT tetrad 
through major and minor groove alignment, in addition toG G-G-G tet- 
rad formation. The definitive experimental identification of such major 
groove-aligned mixed ATA*T and GCGC tetrads within a quadru- 
plex scaffold, has important implications for the potential alignment of 
duplex segments during homologous recombination. 

*f , 2001 Academic Press 

Keywords: A-T-AT and C C G C tetrads; dimeric DNA quadruplex; 
hydrogen bond alignments; inter-subunit Watson-Crick pairs 



Introduction 

There is an increasing appreciation of the role 
DNA quadruples (reviews* -4 ) may play in bio- 
logical processes ranging from replication, tran- 
scription and recombination (review 5 ) to telomere 
function (review 4 ). The earliest research focused on 
quadruple* formation involving stacked G«G G G 
tetrads (review*), with polymorphism introduced 
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by the directionality of adjacent strands around the 
quadruplex (review 4 ). 

Formation of mixed tetrads could dramatically 
increase the versatility of quadruplex formation by 
allowing adoption of this four-stranded architec- 
ture by sequences other than simple G (l repeats. 
Towards this end, efforts have been made to Ident- 
ify and characterize G CGC tetrads, where a 
pair of Watson-Crick G-C pairs can potentially 
align either through their major groove or their 
minor groove edges* This approach was successful 
in the case of the Fragile X syndrome triplet 
repeat-containing d(GCGGT^GCGG) sequence, 
which dimerizes in solution through head-to-tail 
alignment of hairpins. The resulting quadruplex 
contains GCGC tetrads, through major groove 
alignment of Watson-Crick G C pairs (Figure 1 (a)). 7 
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Figure 1. Potential G-CC C tet- 
rad Formation involving either (a) 
direct or (b) slipped alignment of 
major groove edges of a pair of 
Watson-Cnck G . C pairs. The 
slipped alignment require© a mono 
vBienfc cation to coordinate the 
inwardly oriented acceptor atoms, 
Potential A T- A T tetrad formation 
involving either (a) direct or (b) 
slipped alignment of major groove 
edges of a pair of Watson-Crick 
A T pairs. 



Independent structural studies have identified 
minor groove-aligned GC GC tetrads (Sup- 
plementary Material, Figure SI (a)) in the crystal- 
line" and solution states/ 

More recent studies have demonstrated that 
G C G-C tetrads aligned through their major 
groove edges can switch between two distinct 
alignment geometries. In the direct alignment 
(figure 1(a)), the two G C pairs align directly 
opposite each other, resulting in the acceptor 
atom* (06 and N7) of G pairing with the donor 
group (NHr*) of C Alternately, in the slipped 
alignment (Figure 1(b)), the major groove edges of 
two guanine bases are positioned opposite each 
other. In this case, a monovalent cation (not necess- 
arily in the plane of the tetrad) is needed to coordi* 
nate the four inwardly pointing nitrogen and 
oxygen acceptor atoms on the guanine bases 
(Figure 1(b)), Solution structural studies of the 
adeno associated viral repeat-containing 
d(GGGCT 4 GCGC) sequence have demonstrated 
thai the direct G CGC tetrad alignment 
(Figure 1(a)) is formed in Na cation solution, 10 
while the slipped G-OGC tetrad alignment 
(Figure 1(b)) is formed in K cation solution." Such 
conformational transitions were unanticipated, and 
attest to the diversity of pairing geometries in 
mixed tetrad alignments and the role of cations in 
modulating this transition. 

The major groove-aligned G*C-G>C tetrad has 
now been observed in a range of DNA quad- 
ruples 7 ' 1012 and appears to be a robust tetrad 
motif adopted by a range of ON A sequences. One 
can therefore anticipate potential forma ti n of the 
corresponding major-groove aligned AT-AT 
counterpart, either of the direct (Figure 1(c)) or 



slipped (Figure 1(d)) alignment type. By contrast 
the corresponding minor groove-aligned A- T A-T 
tetrad, of the slipped cation-coordination type 
(Supplementary Material Figure Sl(b)), has been 
reported earlier both in the crystalline 1 a and 
solution 9 states. 

Demonstration of the major groove-aligned 
A* T A T tetrad formation has turned out to be a 
considerable challenge, and after many unsuccess- 
ful attempts, our group has now identified a 
sequence, d (G AGC A GGT), which forms a head-to- 
head dimeric quadruplex (Figure 2a and Sup- 
plementary Material, Figure S2), stabilized by 
direct GCGC (Figure 2(b)) and slipped 
A T AJ (Figure 2(d)) tetrads, flanking a central 
G'G-G-G tetrad (Figure 2(c)). Strikingly, we 
observe Watson-Crick G C pairing between mono- 
mer subunits within the GCGC tetrad and Wat- 
son-Crick A«T pairing between monomer subunits 
within the A-T-AJ tetrad in this quadruplex. 
Finally, the edgewise turns in the head-to-head 
dimeric quadruplex involve a single adenine resi- 
due and proviae new insights into chain reversal 
in DNA quad ruplexes. 



Results 

Our group scanned a range of sequences con- 
taining G, A and T residues in our attempts to gen- 
erate a major groove-aligned A T* A T tetrad, 
stabilized through stacking on a G G G G tetrad 
within a quadruplex scaffold. This approach has 
worked in the past in our laboratory when we 
have generated stable triads"" 1 * and 
A'(G G G-G)'A hexads, 12 - 1 * stabilized through 
stocking onG'G'G G tetrads within a quadruplex 
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figure 2. (a) Folding topology of 
a head-to-head dirneric 4(GAG- 
CAGGT) quadruplex in 1 M NaCI. 
The backbone tracing of th* indi- 
vidual strands are shown by thick 
lines and the chain directionality 
indicated by arrows. Pairing align- 
ments for (b) the G6 C4 G6C4 tet- 
rad, (c) the G3 G7 G3 C7 tetrad 
and (d) the A2.T8A2-TB tetrad. 



scaffold. We report on NMR studies of the d(GAG- 
CAGGT) sequence/ which forms a dirneric quadru- 
ples containing a major groove-aligned slipped 
A-T A-T (Figure 2(c)) tetrad. 



Imlno proton spectra and assignments 

The exchangeable proton spectrum (5.5 to 14.5 
ppm) of the d(GAGCAGGT) sequence was highly 
dependent on added NaCI concentration. At low 
NaCI concentration/ the predominant conformation 
gave very broad resonances In slow exchange with 
a minor component that gave narrow resonances 
(Supplementary Material, Figure S3). The equili- 
brium shifted with Increasing NaCI to the confor- 
mation that exhibited narrow resonances, resulting 
in the spectrum shown in Figure 3(a) in 1 M NaCI, 
2 mM phosphate buffer (pH 6.6) at 10 *C. TWb 
spectrum exhibits well resolved exchangeable (8*0 
to 15.0 ppm) and non-exchangeable (7,0 to 8.7 
ppm) resonances, with the total number of imino, 
amino and base proton resonances consistent with 
formation of a single conformer in solution. We 
observe two narrow imino protons between 11 and 
12 ppm, a region characteristic of N*H» • -O hydro 
gen bonds/ and narrow resonances at 13,11 ppm 
and 14.28 ppm, a region characteristic of N-H- -N 
hydrogen bonds 4 (Figure 3(a)). We also observe 
two narrow amino proton resonances between 8.0 
and S.5 ppm and two additional narrow amino 



proton resonances downfield-shifted between 9.0 
and 9.2 ppm (Figure 3(a)). 

The exchangeable imino and amino protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
'^N-labeled d(GAGCAGGT) sequence. An 
expanded 60 ms mixing time NOESY contour plot 
of d(GAGCAGGT) in 1 M NaC at 0 r 'C is plotted 
In Figure 3(b). The NOE cross-peaks are labeled in 
the Figure and the assignments are given in the 
legend. The corresponding NOESY data set 
recorded at a longer mixing time of 200 ms is 
plotted in Supplementary Material, Figure S4, and 
the cross-peaks assignments listed in the legend. 
We could readily distinguish thymine from gua- 
nine imino protons because of their distinct nitro- 
gen chemical shifts in a , H- 15 N HSQC spectrum 
(Supplementary Material, Figure S5). 

We establish formation of an A2-T8 Watson- 
Crick base-pair based on NOEs between the imino 
proton of T8 and the amino (peaks a and a', 
Figure 3(b)) and H2 (peak b, Figure 3(b)) protons 
of A2, In addition, we unexpectedly observe NOEs 
between the H8 and NH 2 protons of A2 (peaks n 
and n', Figure 3(b)), consistent with formation of a 
major groove-aligned slipped A2T8A2T8 tetrad, 
schematically outlined in Figure 2(d). 

We establish formation of a C6C4 Watson- 
Crick base-pair based on NOEs between the imino 

proton of G6 and the amino protons of C4 (peaks c 
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Figure 3. (a) Proton MMR spectrum (7.0 to 15.0 ppm) of the d(GAGCACCT) quadruplex (8.8 mM in strands) in 
1 M NaO, 2 mM phosphate in H a O (pH 6.6) at 10°C (b) Expanded NOESY (60 ms mixing time) contour Plot comm- 




and f, G7(NH1Ki7(NB>-2); g (observable In 200 ms mixing time NOESY experiment), G7(NH1)-G3(H8); h, G7(NH,- 
2)<i7(NT-I a -2); 1, GSfNH^K^CNH^); j and |', G3(^ 2 -*)-G7(H8); K C4(NTH 2 -4>C4(NH 2 -4); 1 and Y, C4(NH 2 -4)- 
G6(H8); m and m', C4(NH a -4K4(H5); n and n', A2(NH r 6)-A2(H8); 0, A2(NH^A2(NH 2 -*); p, G6(H8)-C4(H5). 



and <?, Figure 3(b)). We also detect NOlis between 
the amino protons of C4 and the H8 proton of G6 
(peaks 1 and 1', Figure 3(b)), and between the H5 
proton of C4 and the H8 proton of G6 (peak p, 
Figure 3(b)), consistent with format ion of a major 



groove-aligned direct G6 C4 G6 C4 tetrad, sche- 
matically outlined in Figure 2(b). 

The narrow G3 and G7 imino protons exhibit a 
set of NOEs indicative of G3 G7 G3 G7 tetrad for- 
mation, schematically outlined in Figure 2(c). 
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These include NOEs between the H8 proton of G7 
with the imino (peak e, Figure 3(b)) and amino 
(peak ) and Figure 3(b)) protons of G3, and an 
IMOl: between the H8 proton of G3 and the imino 
proton of G7 (box g, Figure 3(b); observable in 
200 ms NOIiSY spectrum, peak g, Supplementary 
Material Figure S4). 

The NOU data outlined above for the d(GAG- 
CAGGT) sequence are consistent with formation of 
slipped A2T8-A2.T8 (Figure 2(d)), direct 
G6<C4'G6>C4 (Figure 2(b)) and G3 G7 G3 G7 
(Figure 2c) tetrads. This can be most readily 
achieved by hend*to-head dimerization to form a 
quadruplex as shown schematically in Figure 2(a) 
and Supplementary Material, Figure S2. The 
exchangeable imino and amino proton chemical 
shifts of the d(GAGCAGGT) quadruplex are listed 
in Supplementary Material, Table SL 

Non-exchangeable proton spectra 
end assignments 

The non-exchangeable base and sugar protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
^"N-labeled d(GAGCAGGT) sequence. An 
expanded 250 ins mixing time NOESY contour 
plot of d(GAGCACGT) quadruplex in 1 M Nad, 
$ H 2 0 buffer at 10 °C Is plotted in Figure 4(a), along 
with a tracing of distance connectivities between 
base protons and their own and S'-flanklng sugar 
HI' protons from Gl to T8 in the sequence. The G3 
residue adops a syn alignment based on the strong 
H8 to its own HV NOE in a 50 ms mixing NOESY 
stacked plot (Figure 4(b)). The remaining sugar 
proton chemical shifts were obtained from an anal- 
ysis of other regions of the through space NOESY 
contour plot and through bond COSY and TOCSY 
correlations of the assigned sugar Hi' protons with 
the remaining sugar protons within individual 
rings. The non-exchangeable base and sugar pro- 
ton chemical shifts of the d(GAGCAGGT) quadru- 
plex are listed In Supplementary Material, Table S2. 
The H8 proton of A2 (8.71 ppm) is downfield- 
shifted while th*> H6 (6.78 ppm) and H5 (5.15 ppm) 
protons of C4 are somewhat upfield -shifted. 

We can also correlate exchangeable imino pro- 
tons with their own H8 protons within individual 
guanine baBes, via through bond correlations to 
their C5 ring carbon atoms. 211 Such a correlation 
experiment on the sample of uniformly 13 C, ,!fc N- 
labeled d(GAGCAGGT) sequence is shown in 
Figure 4(c), 

Hydrogen bond alignments 

We have verified the formation of NOH-based 
G6-C4-G6-C4 (Figure 2(b)), G3 G7 G3 G7 
(figure 2(c)) and A2 T8 A2 T8 (Figure 2(d)) tetrad 
alignments, by identifying through bond coupling 
connectivities across N-H- N hydrogen bonds* 1-25 
within the folded architecture of the uniformly 
i: *C,™N-labeled d(GAGCACCT) Quadruplex In 1 M 



Nad, H 3 0 buffer at OX. The amino protons of 
A2 f G3 and C4 could be correlated with their 
directly attached nitrogen atoms using this labeled 
sample as shown in Supplementary Material 
Figure S6. 

We observe a N-H' N hydrogen bond connec- 
tivity between the N1H donor of G6 and the 
N3 acceptor of C4 (peak 4, Figure 5(a)) in a 
HNN«COSY experiment 24 recorded on the d{GAG- 
CAGGT) quadruplex, This connectivity provides 
direct support for a Watson-Crick G6-C4 align- 
ment (observed couplings as shaded region in 
Figure 2(b)). In addition, we observe HNN-COSY 
coupling connectivities between the NH 2 protons 
of C4 and the N7 of G6 (peaks 1 and 1', Figure 5(b)) 
and four bond H(CN)N(H) coupling con- 
nectivities 24 between the N4 of C4 and the H8 of 
G6 (peak 3, Figure 5(c)). These observations verify 
the direct alignment of opposing Watson-Crick 
G6.C4 pairs through their major groove edges 
(observed coupling as shaded region In 
Figure 2(b)), to form the G6 C4 G6 C4 tetrad out- 
lined in Figure 2(b). 

The observed H(CN)N(H) coupl ing con- 
nectivities 24 between the H8 proton of G7 and the 
N2 of G3 (peak 1, Figure 5(c)) and between the H8 
proton of G3 and the N2 of G7 (peak 2, Figure 5(c)), 
verify the proposed hydrogen bonding patterns 
(shaded regions in Figure 2(c)) around the 
G3 G7 G3 G7 tetrad. 

We observe a N-H* *N hydrogen bond connec- 
tivity between the N3H donor of T8 and the Nil 
acceptor of A2 (peak 2, Figure 5(a)) in a HNN- 
COSY experiment. This connectivity provides 
direct Bupport for a Watson-Crick A2T6 align- 
ment. In addition, we observe HNN-COSY coup- 
ling connectivities between the NH a protons of A2 
and the N7 of A2 (peaks 2 and 2', Figure 5(b)). 
This connectivity could be either within an adenine 
or between adenine bases positioned opposite each 
other and hydrogen-bonded through their major 
groove edges, as shown for the A2 T8 A2 T8 tet- 
rad alignment in Figure 2(d). We do not observe 
the corresponding coupling connectivities between 
the NH 2 and N7 of A3, and hence we favor the lat- 
ter explanation over the former. These observations 
verify the slipped alignment of opposing Watson- 
Crick A2-T8 pairs through their major groove 
edges, to form the A2 T8 A2 T8 tetrad outlined in 
Figure 2(d). 

Intra-strand verau* Inter-strand NOE restraints 

The 2-fold symmetry in the dimeric d(GAG- 
CAGGT) quadruple* fold makes it critical to 
unambiguously differentiate between intra-strand 
and Inter-strand contributions for key NOEs that 
define the folding topology in solution. We pre- 
pared a sample containing an equlmolar mixture 
of unlabeled and uniformly ,a C, I3 N-labeled 
d(GAGCAGGT) sequences and recorded 15 N-edi- 
ted (eft,), l3 C,^N-purged «»*) NOfcSY* ah (100 ms 
mixing rime) spectra in 1 M NaO-cr>ntaining H ? 0 
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Flflure 4. (a) Expanded NOESY (250 ms mixing time) contour pk>t correlating NOEr between base and sugar H-1 
prX in thi SaGCAGG-0 quadruple* (S^mM in strand/in 1 M NaC), 2 mM phosphate, *H a O (pH 6.6) a 
UVCTne cytUine H6 and H5 NOEs are designated by asterisks. The NOE crow-peaks a and b are assigned as fol- 
low* a, A2(H2)-A2(HV); b, A5(H2)-A5(Hr). (b) Expanded NOESY (50 ms mix ins ; time) stacked plot costing 
NOEs between base and sugar H-1' protons in the d(GAGCAGGT) quadruple* (8.8 mM in strands) In 1 M NaO, 
i mM hhorohiih. 'H^> (pH 1.6) at 0-C. The cytidlne H6 and HS NOEs are designated by asterisks, (c Correlation of 
JhTglifitto and $6 Tprotoni by through-bond connectivities to the C5 cartan within Individual guanine rings 
ta thVunWormly 'X-V'N-labkd d(GAGCAGCT) quadruplex (5 3 mM in strands) in 1 M NaCI, 2 mM phosphate, 
3 H,0 (pH 6.6) at 0 4 C. 
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Figure 5. Identification of N- 
H"N hydrogen bond alignments 
in uniformly ,a C, "N-labeled 
d(CAGCAGGT) (5.3 mM in 
strand?) in 1 M NaCl, 2 mM phos 
phate, H a O (pH 6.6) at 0 U C 
Expanded 2 1 W HNN-COSY con- 
tour plots correlating two-bond 
coupling connectivities between 
donor and acceptor nitrogen atoms 
within N-H--N hydrogen-bond 
alignments across base-pairs, (a) 
Two-bond coupling connectivities 
between TB(N3) Imlno donor and 
A2(N1) acceptor nitrogen atoms 
(peoV 2) across the A2(N1)TB(N3 
donor) Watson-Crick base-pair and 
between G6(N1) imino donor and 
C4(N3) acceptor nitrogen atoms 
(peak 4) across the C4(N3) G6(N1 
aonor) Watson-Crick base-pair (b) 
Two-bond coupling connectivities 
between C4(N4) amino donor and 
G6(N7) acceptor nitrogen atoms 
(peaks 1,1') across C4(N4 ami- 
no) G6(N7) mismatch pair and 
between A2(N6) amino aonor and 
A2(N7) acceptor nitrogen atoms 
(peaks 2,2') across A2(N6 ami- 
no) ■ A2(N7) misma tch pai r (c) 
H(CN)N(H) spectrum showing 
inter-nucleotide HSfcuJ-Nfcfro,) and 
H8(to J )*N4(w J ) cross-peaks. The* 
spectrum consisted of 60S (t 2 ) * 60 
(t x ) complex points, with 160 trnnsi 
ents per FID. Spectral widths of 
2000 Hz (/,max: 30 ms) and 
8000 Hz (/ 2 max: 76 ms) were used, 
with a relaxation delay of two 
seconds, resulting in a total acqui- 
sition of 12 hours. Coupling con- 
nectivities are observed between 
the H8 of G7 and the N2 of G3 
(peak 1), between the HB of G3 
and the N2 of G7 (peak 2), as well 
as between the H8 of C6 and N4 of 
C4 (peak 3). A coupling connec- 
tivity was not detectable between 
the H8 of A2 and the M6 of AZ 



buffer solution at 0 U C, to Identify inrer-strand 
NOIEb and differentiate them from their intra- 
strand counterpnrts, for the 50 % component in the 



mixture where the quadruplex contains one 
unlabeled and one uniformly labeled strand. We 
observe inter-strand NOEs between the imlno pro- 
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ton of T8 and the amino (peaks 1 and 1', 
Figure 6(a); peaks 1 and 1', Figure 6(b)) and H2 
(peak 2, figure 6(b)) protons of A2, establishing 
formation of Watson-Crick A2*T6 pairs between 
hairpin subunihs in the dimeric d(GAGCACGT) 
quadruple*, We also observe inter-strand NOEs 
between the imino proton of G6 and the amino 
protons of C4 (peaks 2 and 2', Figure 6(a); peaks 3 
and 3', Figure 6(b)), establishing formation of Wat- 
son-Crick G6 C4 pairs between hairpin subunits in 
the dimeric d(GAGCAGGT) quadruples 

We also observe inter-strand NOEs between the 
amino protons of A2 and the H8 proton of A2 
(peakB 4 and 4', Figure 6(a)), putting restraints on 
the relative alignments of the AT base-pairs 
around the AT A-T tetrad. Interstrand NOEs are 
also observed between the imino proton of G? and 
the H2 proton of A2 (peak 4, Figure 6(b)), putting 
restraints on the relative stacking of adjacent 
G3 G7 G3 G7 and A2T8A2T8 tetrads in the 
quad rup lex. 

Distance restraints end molecular 
dynamics calculations 

Distance restraints associated with exchangeable 
protons (total of 115) were qualitatively deduced 
from NOESY experiments In H 2 0 at two mixing 
times, while those associated with their non- 
exchangeable proton counterparts (total of 178) 
were quantified from NOE buildup curves in 2 H20 
at four mixing times, as outlined in Materials and 
Methods. The observation of a single set of narrow 
resonances for the d(CAGCAGGT) sequence at 
temperatures down to 0°C, was consistent with 
formation of a dimeric quadruplex, containing two 
strands related by a 2-fold symmetry axis* There- 
fore, non-crystallographic symmetry restraints 
were used during the computations. All distance 
restraints were classified as ambiguous during the 
distance-restrained molecular dynamics compu- 
tations. Experimentally defined hydrogen bonding 
alignments from NOE patterns on unlabeled 
sample, N-H < < N scalar couplings on isotopically 
labeled sample, and intermolecular NOEs on an 
equimolar mixture of labeled and unlabeled 
sample, were used to restrain the G6-C4'G6-C4 
(Figure 2(b)), G3G7G3G7 (Figure 2(c)) and 
A2-T8-A2.T8 (Figure 2(d)) tetrads, with the fold- 
ing models retaining these hydrogen bonding 
alignments during the computations. 

The solution structure of the dimeric d(CAG- 
CAGGT) quadruplex in 1 M NaCl was solved by 
molecular dynamics computations guided by 
hydrogen bonding and NOE distance restraints. 
Sixty starting structures were generated for the 
d(CAGCAGGT) 8-mer segment as sets of pairs of 
randomized chains separated by space Intervals of 
50 A. The protocol outlined in Materials and 
Methods involved Initial torsion space dynamics at 
20,000 K followed by Cartesian space dynamics at 
300 K. A subset of ten distance-refined structures 
of the d(CAGCAGGT) quadruplex were identified 



based on a combination of low NOE energies and 
fewest NOB violations. 

Intensity restraints end NOE back calculations 

The subset of ten converged distance-reftned 
structures were next refined against the non- 
exchangeable proton NOE intensities associated 
with NOESY spectra recorded at four mixing 
times. These computations utilized a molecular 
dynamics with back calculation protocol outlined 
in Materials and Methods. The NOE violations, 
deviations from covalent geometry and pairwise 
r.m.s.d. values for the ten lowest energy Intensity- 
refined structures of the d(GAGCAGCT) quadru* 
plex (less the poorly defined Gl residues) are listed 
in Table 1. 

Structural faaturaa 

A stereo view of the ten superpositloned lowest 
energy intensity refined structures of the d(GAG- 
CAGGT) quadruplex (less the poorly defined CI 
residues) is shown in Figure 7(a). The sugar-phos- 
phate backbone of Individual symmetry-related 
hairpins are colored in orange and green, with the 
sequentially stacked loop A5 residue, 
G6 C4 G6 C4 tetrad, G3 G7 G3 G7 tetrad and 
A2 T8-A2 T8 tetrad colored in white, magenta, 
yellow and cyan, respectively. Ribbon and surface 
views of one representative refined structure of the 
d(GACCAGGT) quadruplex using the GRASP pro- 
gram are plotted in Figure 7(b) and (c), respect- 
ively. 

A stick representation of one symmetric strand 
of the d(GAGCAGGT) quadruplex (less the poorly 
defined Gl residues) is shown in Figure 8(a). The 
pairing alignments of the G6-C4 G6-C4 tetrad and 
A2 T8 A2 T8 tetrad in this representative struc- 
ture are shown in Figure 8(b) and (c), respectively. 

The stacking geometries between the A5 bases 
(in whJte) with the G6C4G6C4 tetrad (in 
magenta) is shown in Figure 9(a), while that 
between the G6 C4 G6 C4 tetrad (in magenta) 
and the G3 G7 G3 G7 tetrad (in yellow) Is shown 
in Figure 9(b). The overlap geometry between the 
G3-G7-G3.G7 tetrad (in yellow) and the 
A2 T8 A2 T8 tetrad (in cyan) is shown in 
Figure 9(c). 

Discussion 

The NMR-based structural characterization of 
the d(GAGCAGGT) quadruplex was considerably 
aided by the narrow and welKresolved resonances 
of the one and two-dimensional proton 1MMR spec- 
tra, and the avaUability of uniformly l3 C/*N- 
labeled sequence, which aided greatly in the reson- 
ance assignment and identification of hydrogen 
bonding alignments. The computations were also 
aided by our ability to distinguish between inter- 
strand and intra-srrand NOEs (and in turn, Wat- 
son-Crick hydrogen bonding alignments) using 
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Table 1. NMR restraints and structural statistics for the Intensity refined structures of the dimeric d(GAGCAGGT) 
quadruple* . 



A. NMR reaihiintv 
l>i&t.incv rwtralnhi* 

InlrA-rVtildue distance ftalraint* 

Su|ucniial 0, i + 1) distance restraints 

\jor\% range 3>(h ' + 2) distance restraint 
Other restraint 

Hydrogen bonding rwttralnh** 

Torsion angle restraints* 
IntOiUtlty nistraintK" 

Nunnxchangoable proton* (each of 4 mixing time*) 

B. Structural tlalitlio in tamplfi foltowm$ inlttttity reflnemrnl 
NO; violation* 

Number >tX2 A 
Maximum violation* 
r.m.b<d. of violation* 
NMK A factor (R wt ) 

Do via lions from U1*miI covalont geometry 
tktnd length* 
Hand angles 
Impropt're 

(VirwUt* all heavy atom r.m.*d. values <10 refined strueutrisi) 
All heavy atom* excluding CI 



Non-cxchangtable 
86 
76 
16 



48 

14 

178 



6.1 ±0,7 
0,27±0.03 
0.036±0,02 
0.074±0.00S 

o.on±o.ooi 

3.70±0,09 
0.4O±a03 

0.78±0.17 



Exchangeable 
19 
45 
51 



* All distance rwtMlnh* were w* as ambiguous between intra and inter- residue contribution*. 
h These hydrogen bonding restraint* arc based on experimental NOK and s /nn coupling data. 

■ RoBiduw A2, C4, AS, C6, G7 and T8 were restrained to x values In the 2l0(±40r range, characteristic of fluff glyccwldk: tomion 
angU-a, wbiU? re*idu«. C3 wen? restrained to x values in the 65<±40K range, characterise of syn glycoaidic totalon angles, identified 
exporimen tally, . 



equlmolar mixtures of labeled and unlabeled 
samples of the quadruplex. 

Quadruple* architecture 

The experimental data on the d(GAGCAGGT) 
sequence is consistent with formation a dimeric 
quadruplex In 1M NIoCI solution. Individual 
strands of d(GAGCAGGT) form hairpins, with a 
single adenine, A5, involved In chain reversal. 
Dimerization involves head-to-head orientation of 
hairpins (figures 2(a) and 7; Supplementary 
Material, Figure SI), with individual strands run- 
ning a nti parallel relative to their neighbors around 
the iquadruplex (Figure 2(a)). 

G G G G tetrad 

We detect a G3(syn)G7{anti)>G3{8y>})G7{dfltt) 
alignment around the GGGG tetrad in the 
dimeric d(GAGCAGGT) quadruplex. Such an 
alignment about the G G-G-G tetrad is consistent 
with antiparallel alignment of adjacent strands 
around the quadruplex, and has been reported for 
the solution structures of the thrombin-binding 
DNA aptamer quadruple*, 27 ' 28 as well as the 
d(GCGGT;,GCGG) Fragile X syndrome-containing 
sequence quadruple* 7 and the d(GGGCT 4 GGGC) 
adeno-associated virus-containing sequence quad- 
ruplex. 10 

G C G C tetrad 

The G6.C4.C6.C4 tetrad (Figures 2(b) and 8(b)) 
is defined by all nnti glycosidic torsion angles, con- 



sistent with antiparallel arrangement of adjacent 
strands around the dimeric d(GAGCAGGT) 
quadruplex. This type of direct G-C-G-C tetrad 
alignment (Figure 1(a)) has been observed pre- 
viously for the solution structures of the 
d(GCGGTjGCGG) Fragile X syndrome sequence- 
containing quadruplex 7 and the d(GGGCT 4 GGGC) 
adeno-associated virus sequence-containing quad- 
ruple^ 10 There is, however, a critical distinction in 
that Watson-Crick G6-C4 pairs are formed 
between rather than within hairpin subunits 
(Figure 8(b)) in the head-to-head dimeric d(GAG- 
CAGGT) quadruplex reported here. This contrasts 
with Watson-Crick GC pairs within hairpin 
subunits in the head-to-tail dimeric 
d(GCGGT 3 GCGG) 7 and d(GGGCT 4 GGGC) 10 quad- 
ruples, reported previously. 

The amino protons of C4 in the d(GAGCAGGT) 
quadruplex resonate at 8.49 ppm and 9.23 ppm. 
The chemical shift of 8.49 ppm is characteristic of a 
cytoslne amino proton hydrogen-bonded to a car- 
bonyl oxygen atom, such as would occur with a 
guanine residue in a Watson-Crick G C pair. The 
chemical shift of 9.23 ppm is consistent with the 
other cytosine amino proton also being hydrogen 
bonded, and its much larger downfield shift may 
be reflective of its acceptors being both a ring nitro- 
gen and carbonyl oxygen, such as would occur in 
a direct GC GC tetrad (Figure 1(a)). 

A T A'T tetrad 

The slipped A2 T8 A2 T8 tetrad (Figures 2(d) 
and 8(c)) contains all anti glycosidic torsion angles, 
with hydrogen bonding along the major groove 
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Figure 6. Identification of Inter-strand NOEe. An expanded ,5 N-ed>ted <«,), 'XT, "N-purged to) NOESY (100 m 
mixing time) contour plot of a 1:1 mixture of unlabeled and uniformly l3 C, "N-labeled d(GACCAGGT) (3,3 mM in 
strnncfe) in 1 M NaCl, 2 mM phosphate, H a Q (pH 6.6) at 0"C (a) Carrier Is on amino- 1 ^- J?lfv«n^ ^-peakf. 




4); 4, G7(NH1)-A2{H2). 



assigned 



edges of opposing adenine bases. This is the first 
report of any type (direct or slipped) of an 
A'T-A'T tetrad alignment, which involves pairing 
along the major groove edges. This A-T-A-T pair- 
ing alignment brings the H8 proton of one adenine 
in close proximity to the NH, proton of its adenine 
partner (Figure 1(d)), consistent with the observed 
intermodular NOU between these pair of protons 
in mixed labeling experiments (peak 4 and 4', 
Figure 6(a)). Equally important, is our demon- 
stration that Watson-Crick A2T8 pairs are formed 
between rathvr than within hairpin subunits 
(Figure 8(c)) in the head -to-head dimerlc d(GAG- 
CAGGT) quadruplex. 

The amino protons of A2 in the d(GAGCAGGT) 
quadruplex resonate at 7,80 ppm and 9.16 ppm. 
The chemical shift of 7.80 ppm is characteristic of 
an adenine amino proton hydrogen-bonded to a 
carbonyl oxygen atom, such as would occur with a 
thymine residue in a Watson-Crick AT pair. The 
chemical shift of 9.16 ppm is consistent with the 
other adenine amino proton also being hydrogen 
bonded, and its much larger downfield shift is 

reflective of Its acceptor being a ring nitrogen, such 



as would occur in a slipped ATAT tetrad 
(Figure 1(d)). 



Single bass chain reversal 

Chain reversal involves a single adenine, A3, 
within individual hairpin subunits (Figure 8(a)). 
The A5 base is bracketed by a closing G6-C4 mis- 
match involving pairing between the amino group 
of C4 and the acceptor atoms along the major 
groove edge of G6 (Figure 2(b)). This concept of a 
single base hairpin loop bracketed by a mismatch 
pair, was first reported for 6ingle base loops closed 
by sheared G A mismatches in antiparallel 
duplexes. 29 - 30 There is no pairing between A5 resi- 
dues acrosB from each other (Figure 9(a)) in the 
head-to-head hairpin dimeric d(GAGCAGGT) 
quadruplex. Rather, the A5 residues are stacked 
over the adjacent C4 residues (Figure 9(a)), 

We have checked for non-standard backbone 
torsion and sugar pucker pseudo-rotation (P) 
angles within the single residue edgewise turn 
spanning the C4-A5-G6 segment amongst the 
refined structures of the dimeric d(GAGCAGGT) 

quadruplex. For the C4-A5 step, the P value for C4 



Inter-subunit Watson*Cnck Pairs 



1083 




Figure 7. (b) Superpoaitioned stereo view of ten Intensity refined structure* of the dimerk d(GAGCAGGT) quad- 
ruple*. The backbones of the individual strands ere in orange and green with phosphate oxygen atoms removed 
for clarity. The A5 residue is In white, the G6-C4 G6-C4 tetrad in magenta, G3'G7«G3.G7 tetrad in yellow 
and A2'T6-A2<T8 tetrad in cyan, (b) A GRASP ribbon view of a representative intensity refined structure of the 
d(GAGCAGGT) quadruple*. The backbones of Individual strands are colored in blue and red. Guanine, adenine and 
thymine bHses on? colored yellow, red and blue, respectively, (c) A GRASP surface view of a representative intensity 
refined structure of the d(GAGCAGGT) quadruples 



of 69.8(^:5,7) " places it In the C4'~exo range, the y 
value of 149.9(±40.1)" places it in the trans range 
(in contrast to standard gauche + value of 64 °), and 
the x value for AS of 170.1(±4.6)* places it in the 
low anli range. 



Bate •tacking 

There is excellent pyrimidine/purine and pur- 
ine/purine stacking between the bases of the direct 
G6 C4 G6 C4 tetrad (in magenta) and the 
G3 C7 C3 G7 tetrad (in yellow) In the folded top- 
ology of the d(GAGCAGGT) quadruples 
(Figure 9(b)). We observe partial stacking between 
the bases of tb? G3*G7-G3-G7 tetrad (in yellow) 
and the slipped A2T8'A2*Tc5 tetrad (in cyan) in 
the folded topology of the d(GAGCAGGT) quad- 
ruple* (Figure 9(c)). Overall, there is significant 
stacking of bases along the length of the dimeric 
d(GAGCAGGT) quadruples 

The stacking pattern between the G3G7G3G7 
tetrad (in yellow) and the slipped A2T8A2T8 
tetrad (in cyan) (Figure 9(c)), potions the imino 
proton of G7 of one strand in close proximity to 
the NH 2 and H2 protons of A2 of the partner 
strand, consistent with the observed inter-strand 
NOEs (peaks 3 and 3', figure 6(a); peak 4, 

Figure 6(b)) in mixed labeling experiments. 



Inter-strand NOEs 

We have observed a set of weaker inter-strand 
NOEs, in addition to their stronger counterparts, 
shown in Figure 6(a) and (b), that merit further dis- 
cussion. The studies presented in Figure 6 on an 
equimolar sample of unlabeled and uniformly 
^^N-labeled d(GAGCAGGT) quadruplex in 1 M 
NaCl-containing H 2 0 buffer, were recorded at 
O'C We have also collected a 15 N-edited (co t ), 
13 C, ,5 N-purged ((Oa) NOISY 26 (100 ma mixing 
time) spectrum, with the carrier on the iminr> ,a N, 
recorded at 10°C, This spectrum had improved 
signaNto-noise, permitting data presentation at a 
lower contour level (as shown in Supplementary 
Material, Figure S7). We observe a set of weak 
inter-strand NOE cross-peaks labeled 5 to 9 in Sup- 
plementary Material, Figure S7, in addition to 
stronger peaks 1 to 4, that were also seen in 
Figure 6(b). 

Peaks 5 and 5', assigned to inter-strand NOHs 
between the imino proton of G7 and the amino 
protons of C4, are consistent with the stacking pat- 
tern shown in Figure 9(b). Peaks 6 and 6', assigned 
to inter-strand NOEs between the imino proton of 
G7 and the amino protons of A2, are consistent 
with the stacking pattern shown in Figure 9(c). 
Peak 9, assigned to an inter-strand NOE between 
the imino of G6 and the H2 of A5, is consistent 
with the stacking pattern shown in Figure 9(a). 
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Figure 3. (a) A stick representation of a symmetric 
half (corresponding to one of the two strands) of a 
representative Intensity refined structure of the d(GAG- 
CAGGT) quadruple*, The color code is the same as out- 
lined in the legend to Figure 7. (b) Pairing alignment of 
the G6 C4 G6 C4 tetrad, (c) Pairing alignment of the 
A2 T8 A2 T8 tet^d. 



By contrast peak 7 assigned to an inter-strand 
NOlf between the imino of G3 and the H8 of A2, 
cannot be explained by the overlap pattern in 
Figure 9(c); Peak 8 y representing an inter-strand 
NOE assigned to the imino of G3 and the H8 pro- 
ton of G6/G7, cannot be explained by the overlap 
pattern shown in Figure 9(b). In both cases, while 
the inter-strand distances between proton pairs are 
long, their intra-strand counterparts are in close 
proximity. Thus, it could be argued that perhaps 
peaks 7 and 8 reflect relatively strong intra-strand 
NOlis that are not completely removed by isotope 
filtering. For this xplanation to be valid, peaks 7 
and H nhnulH have b<vn doublets, because j **C 



Figure 9. Base stacking overlap patterns in a represen- 
tative intensity refined structure of the d(GAGCAGGT) 
quadruple*, (a) Stacking of A5 (in white) on the 
C6 C4 G6 C4 tetrad (in magenta), (b) Stacking of the 
G6 C4 G6 C4 tetrad (in magenta) on the G3 G7 G3 G7 
tetrad (in yellow), (c) Stacking of the G3-G7 G3 G7 tet- 
rod (in yellow) on the A2'TB-A2*TB tetrad (in cyan). 



decoupling was not acquired during acquisition. 
Currently, we are somewhat at a loss to account 
for this discrepancy involving weak peaks 7 and 8 
(Supplementary Material, Figure S7), whnae Inten- 
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sky did vary in the weak range when the exper- 
iments were checked for reproducibility. 

Sett dependence 

The broad resonances for the d(GAGCAGGT) 
sequence in low NaCl solution (Supplementary 
Material, figure 53(a)) is indicative of aggregate 
formation. What was unexpected was the tran- 
sition to narrow resonances in 1 M NaCl solution 
(Supplementary Material, Figure S3(b)), resulting 
in the formation of a dimeric d(GAGCACGT) 
quadruplex, amenable to structural characteriz- 
ation. Both the direct CC G C (Figure 1(a)) and 
slipped A T* A T (Figure 1(d)) tetrads have 
inwardly pointing amino groups, an alignment not 
conducive to coordinating monovalent cations 
itioned either between or within tetrad planes, 
refore, it u; conceivable that 1 M NaCl has 
minimal effect on the integrity of the dimeric 
d(GAGCAGGT) quadruples but for some 
unknown reason may destabilize the aggregated 
species that predominates in low salt solution. 

Comparison with mixed tetrads Involving 
minor groove Alignment of G C and A T pair? 

Our structural studies on major groove-aligned 
direct G-C'G'C and slipped A T. A T tetrads in 
the head-to-head dimeric d(GAGCAGGT) quadru- 
plex (this study) can be compared with reported 
structural studies on quadruplexes involving 
minor groove-aligned direct G • C • G • C (Sup- 
plementary Material, Figure Si (a)) and slipped cat- 
lon-coordianted A-T-A-T (Supplementary 
Material Figure Sl(b)) tetrads formed fey dimeriza- 
tion of linear and cyclic octameric sequenees.* , ' <, ' u 
Interestingly, the minor groove-aligned direct 
C-C G-C (Supplementary Material, Figure 51(a)) 
and slipped ctition-coordinated A-T-A-T (Sup- 
plementary Material, Figure Sl(b)) tetrads also 
involve Watson-Crick G-C and A T pairing 
between rather than within head-to-head hairpin 
dimers of the quadruplex* 

It should be noted that while the major groove- 
aligned direct G C C*C (Figure 1(a)) and slipped 
A TAT (Figure 1(d)) tetrads adopt planar geo- 
metries in the solution structure of the dimeric 
d(GAGCAGGT) quadruplex (this study), the minor 
groove-aligned direct G C G C (Supplementary 
Malerial, Figure S2(a)) and slipped cation-coor- 
dianted ATA-T (Supplementary Material, 
Figure S2(b)) N-trads adopt distinctly non-planar 
geometries within their quadruplexes, both in the 
crystalline 8 * 14 and solution 9 states. 

Blologlcel relevance 

We hove reported the first experimental demon- 
stration for formation of a major groove-aligned 
slipped A T- A T tetrad (Figure 1(d)) within a 
DNA quadruplex architecture. This result, together 
with our previous demonstration of major groove- 



aligned direct G-C -G-C tetrad (Figure 1(a)) 7 for- 
mation within a quadruplex architecture, expands 
on the code for alignment of duplex segments that 
can potentially participate in strand exchange 
during homologous recombination. The glycosidic 
bonds are directed outwards in major groove 
aligned tetrads (Figure 1), while they are directed 
inwards In minor groove-aligned tetrads (Sup- 
plementary Material, Figure SI). There is steric 
crowding between the inwardly pointing sugars in 
the minor groove-aligned tetrads, resulting in base- 
sugar and sugar-sugar Interactions which severely 
buckle the planes of the tetrads.*- 0 - 1 *"* By contrast 
ihere is no such crowding between the outwardly 
pointing sugars in the major groove-aligned mixed 
tetrads, resulting In a planar arrangement of bases 
within the tetrad. Several challenges remain 
despite our identification here of a major groove- 
aligned slipped A- T A T tetrad (Figure 1(d)) at 
one end of a quadruplex. Can one identify for- 
mation of a major groove-aligned direct AT«AT 
tetrad (Figure 3(c)), and can either the direct or 
slipped ATAT tetrads be positioned In the 
interior of the quadruplex? 

McGavin 31 was the first to recognise the poten- 
tial for self-pairing of Watson-Crick G C and A T 
pairs to form dyad axes related major groove- 
aligned G C G C and A T A T tetrads. Such tet- 
rad-based quadruplexes could play a role in strand 
exchange between two homologous duplexes 
during genetic recombination. Wilson 112 next pro- 
posed a quadruplex-based model for formation of 
reciprocal heteroduplexes from their homologous 
duplex counterparts. In this model, homologous 
duplexes initially associate through GCGC and 
A'T'A.T tetrad formation along their minor 
groove edges, placing complementary strands 
opposite each other. Strand exchange can then 
occur following a 90 u rotation of each base, such 
that the base-pairs now face each other through 
G C'G-C and A T A T tetrad formation along 
their major groove edges. Experimental studies 
have provided evidence for quadruplex formation 
by poly (CA) (J . poly (TG) n repeats, while compu- 
tational studies** have established that such strand 
exchange quadruplexes, as well as quadruplex- 
duplex junctions, are stereochemically robust and 
form stable entities. Thus, a future challenge 
would be to design a DNA quadruplex consisting 
solely of GCGC and ATAT tetrads and 
attempt to trap structures where the alignment is 
through the minor groove on the one hand, and 
through the major groove on the other. 



Materials and Methods 

Preparation of unlabeled and uniformly 1 *C, "re- 
labeled DNA 

The unlabeled d(GAGCAGGT) sequence was syn- 
thesized on a 10 Mmol scale on an Applied Biosystems 
392 DNA synthesizer using solid phase (Wyanoethyl- 
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phosphnamidite chemistry and was subsequently puri- 
fied by high pressure liquid chromatography (HPtC), 

A modified version of the Zlmmer & Cromer* 1 * 
procedure as deiicribed' 9 ^ was used for the enzymatic 
synthesis of uniformly »\V 9 N-labeled dfCAGCAGCT) 
sequence. The in-house prepared uniformly yy C*N- 
labeled dNTPs 19 ' 2 * were used as building block* in the 
in vitro polymerization reaction catalyzed by murine 
mammary leukemia virus (MMLV) reverse transcriptase 
(Cibco-uKL), The uniformly ,J t # ,s N-labefcd d(GAG- 
CAGGT) 8-mer was separated from the unlabeled 24- 
mer template using 22% (w/v) denaturing polyacryl- 
amide elect rophweis. The DNA 8-mer bands were 
eluted from the gel by "crush and soak" procedure and 
purified as described above for the non-labeled samples. 

NMR daft* collection and processing 

NMR data on the d(GAGCAGGT) 8-mer in H z O and 
*H 2 0 buffer (1 M NaCl, 2 mM phosphate (pH 6.6)) were 
collected on a Viirian 600 MHz Unity Inow NMR spec- 
trometer. Proton assignments are based on homonuclear 
NOESY, correlation spectroscopy (COSY), TOCSY and 
HCCNH- TOCSY experiments. Data sets were processed 
and analyzed using the FELIX program (Molecular 
Sim Lib Hon*). 

Two bond *7m* scalar couplings between Irnlno and 
amino donors *nd nitrogen acceptors in uniformly 
l "C ,s N-lsbeIed diGAGCACGT) in 1 M NaCl were moni- 
tored in HNN-COSY 21 - 23 and HfCNJNfH) 24 - 2 * contour 
plots using pulse sequences described in the literature. 



Distance raatralnts 

The distances between non-«xchangeable protons 
were estimated from the buildup curves of cross-peak 
intensities in NOESY spectra at four different mixing 
time* (50, 150, :i00 and 250 ms) In 2 H 2 0 and given 
bounds of ±30 % with distances referenced relative to 
the cytoslne H6-H5 distance of 2.47 A. Exchangeable 
proton restraints are based on NOESY data sets at two 
mixing times (60 und 200 ms) in H a O. Cross-peaks invol- 
ving exchangeable protons were classified as strong 
(medium to strong intensity at 60 ms), medium (weak 
intensity at 60 ms) and weak (observed only at a mixing 
time of 200 ms) and proton pairs were then restrained! 
respectively, to distances of 3.0(±0,9) A, 4,0(±1.2) A and 
60(£l-6) A, Since the experimental NMR data are con- 
sistent with a diineric quadruples motif, non-crystallo- 
graphic symmetry restraints were imposed on all heavy 
atoms. 



Structure calculations 

The structure of the d(GAGCAGGT) in I M NnCJ wns 
determined by molecular dynamics (MD)-simulnted 
annealing computations driven by NOE distance and 
hydrogen bondinj; restraints using the X-FLOR package, 
version 3.8. n7 At :he initial stage of the refinement, tor- 
sional molecular dynamics was undertaken at high torn 
perature. The molecules were equilibrated at 20,000 K 
(30,1*00 steps over 3 ps) and then cooled very slowly to 
1000 K (40,000 steps over 20 ps). The potential energy 
function included a repulsive force field, NOE and 
hydrogen bond distance restraints glycosidic bond (x> 
dihedral angle n»Htraint9 and a non-crystallographic sym- 
metry potential. The force constant for NOE distance 
restraints wna maintained at a value of 30 kcal mol 1 



A % while for hydrogen bonds restraints the value wqs 
50 kcal mol 1 A J . All NOE distance restraints were con* 
sidered as ambiguous and treated with the "sum" aver- 
aging option,*'* Dihedral angle restraints (210(^40)^, 
with force constant of 50 kcal mol 1 rad ') were 
imposed on glvcosidic torsion angles for the residues A2, 
C4, A5, C«6, G7 and T8 shown experimentally to adopt 
will conformations, Dihedral angle restraints <65(±40)*\ 
with force constant of 50 kcal mol 1 rad ') were 
imposed on glycosidic torsion angles for the G3 residue 
shown experimentally to adopt syn conformation. The 
force constant for non-crystallogrnphic symmetry was 
maintained at 30 kcal mol" 1 A" 2 . 

These computations were fallowed by lower tempera- 
tun? Cartesian apace molecular dynamics guided by the 
hydrogen bonding and NOE distance restraints with 
changes in the potential energy function: the repulsive 
force field was replaced with Lennard-Jones potentials 
and planarity restraints were included for tetrad planes 
with low weights of 5 kcal mol 1 A * and 10 kcal mol " 1 
A -, respectively, During this stage of the dynamics, the 
structures wen? further cooled from 1000 K to 300 K 
(20,000 steps over 10 ps) and minimized until the gradi- 
ent of energy wa* less than 0.1 kcal mol " 1 . It should be 
noted that computations repeated without planartty 
restraints resulted in the same low energy structures, but 
exhibited a lower convergence rate. 

The refinement protocol started from 60 different 
initial structures. The initial structures were generated as 
sets of two chains, each eight nucleotides long, random- 
ized for all dihedral angles, and separated by space inter- 
vale of 50 A. The convergence rate following dynamics 
was good for the case where the computations were 
guided by hydrogen bonding restraints associated with 
the topology shown in Figure 2(a); Sixteen structures out 
of 60 emerged with the same fold and palrwise r.m.s.d. 
values less than t.O A between members of the group. 
Non -converged structures were separated from that 
group by large gaps (in total more than 400 kcal) in all 
components of the potential energy (van der Waals, 
NOE violations, covalent geometry). 

The ten converged distance refined structures corre- 
sponding to the folding topology shown schematically in 
Figure 1(a) were used as the starting point for sub- 
sequent X-PLOR based energy minimization with back^ 
calculation of the NOESY spectra. The relaxation matrix 
waa set up for the non exchangeable protons, with the 
exchangeable Imino and amino protons exchanged for 
deuterons. A total of 712 non-exchangeable intensity 
values from NOESY data sets at four mixing times in 
2 H 2 0 buffer (178 non-exchangeable intensities per mix- 
ing time) were included with force constant of 500 kcal 
mol The plnnarity restraints were set to very low 
values of 1.0 kcal A~ a for the C-C C G tetrad and 
0.5 kcal A 1 for the A T- A T and GCGC tetrads. The 
disU nee restraints were retained with 30% bounds and 
the same weights as before. During minimi/atioa the 
NMR R factor (R i/ft ) improved from the initial value of 
12% to 7.4% while retaining structure convergence and 
stereochemistry. 



Coordinates deposition 

Coordinates (accession number Ijvc) of the dimeric 
d(CAGCAGGT) quadruple have been deposited in the 
RCSB Protein Data Bank. 
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